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Abstract - The relative entropy distance of a state from an exponential family is important in infor- 
mation theory and statistics. The class of exponential families is parametrized by a Grassmannian. A 
minimal example in the algebra of complex 3x3 matrices shows that the mean value set of an exponen- 
tial family has typically non-exposed faces. Where non-exposed faces are born in the Grassmannian, 
families have a discontinuous entropy distance. 

These two phenomena are related to three distinct closures, which all coincide in the probabilistic 
case of the algebra C^. A necessary condition for a local maximizer of the entropy distance is calculated 
in a finite-dimensional complex matrix algebra. 
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1.1 Overview 



In this article we consider special submanifolds £ of state space S{A) for algebras A of observables. 

i— i A itself is called classical or probabilistic if it is abelian and quantum otherwise. These exponential 

—< families are images of the real affine subspaces of the algebra of observables under the trace-normalized 

exponential map. They are important in information theory, in statistics, in statistical physics and in 

neural networks, to name just a few fields. See Amari and Nagaoka [AN00] for an overview. 

It is well known that classical and quantum state spaces have quite different geometries. Whereas 
in the classical case we have a simplex (and thus every state is uniquely decomposed into pure states), 
i '** l i in the quantum case such a decomposition is highly non-unique (think of the Bloch ball S(A) for 

.A = Mat(2,C)). 

Still from the point of view of convex geometry there is one common property of all these state 
spaces: all of their faces are exposed faces 3 , that is, they can be described as the intersection of state 
space with a half space. 

In the probabilistic setting of A = C N measurement of observables /i, • • • ,/ n leads to an orthogonal 
l/^ projection 

S(A) — ► M'\ p .— ► (E P (/ 1 ),...,E P (/ B )) 

of state space, based on expectation E p . The image, called mean value set or convex support is no 
longer a simplex but still a polygon. So faces of a classical mean value set are exposed faces, too. The 
same applies to all exponential families and their natural projections, see Figure 1. 

We exhibit here two main differences between the abelian and the non-abelian case of an exponential 
S^ family, at least in our minimal example. 

1. First, in the quantum case it is typical in a Grassmann manifold parametrizing the families that 
mean value sets of an exponential family have non-exposed faces. 

2. Second, the entropy distance from an exponential family defined in (7) can be discontinuous in 
exceptional cases. The boundary in the Grassmann manifold dividing non-exposed faces from 
exposed faces seems pivotal for discontinuity. 
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3 non-exposed faces are found, e.g., on the circumference of a stadium, at the four points where a half-circle 
meets a segment. See Section 2.2 for precise definitions. 





Figure 1: Mean value sets for two probabilistic exponential families. Left: triangle; right: 
square. 



We build on the work of Wichmann, who studied ensembles of maximum chaos in A = Mat (AT, C) 
in 1963. We borrow from this source the mean value chart, which we will introduce in Section 1.2. 
However, Wichmann ignored the existence of non-exposed faces at a quantum mean value set: Theorem 
I (e) in [Wi63] concerning extreme points is wrong, it fails in all cases where non-exposed faces appear. 

Non-exposed faces are not a minor issue to the study of entropy distance. First, the simplest 
possible example of a metamorphosis between a classical (simplicial) and a quantum mean value set 
convinces that non-exposed faces are typical. This is depicted in Figure 2 and explained in Section 2. 
Second, the metamorphosis contains one shape that divides mean value sets with non-exposed faces 
from mean value sets without non-exposed faces. This shape corresponds to the Staffelberg family in 
Section 3.2 having a discontinuous entropy distance. In this example the convex geometric notion of 
non-exposed face indicates discontinuity of the entropy distance. 

Another reason to study the above-mentioned phenomena are their counterparts in the structure 
of extensions of exponential families. These have been developed in the classical context of a Borel 
measure on R d by Csiszar and Matus [CM03, CM05]. For a measure of finite support their results 
become rather easy and were already investigated by Barndorff- Nielsen [Ba78] and Cencov [Ce82]. The 
probabilistic case of finite support corresponds to the algebra A = C N where the following extensions 
all coincide. 

Three extensions of £ have been worked out by Weis [We09] in the context of a *-subalgebra A of 
Mat (AT, C). The reverse information closure or rl-closure 

cl rI (£) := {peS(A)\d £ (p)=Q} 

consists of states that approximate £ in relative entropy distance (7) and the norm closure 4 £ is taken 
in the Hilbert-Schmidt norm topology of A. These closures were used by Csiszar and Matus [CM05] but 
the inclusion cl r i(£) C £ holds also in the quantum case. It arises from the Pinsker-Csiszar inequality, 
see e.g. Petz [Pe08]. To our best knowledge the geodesic closure 5 

cl goo (£) := £ U { limit points of e-geodesics in £ } (1) 

is new in the literature. In Section 3 we prove the inclusions 

cl goo (£) c cl r i(£) c£. (2) 

Equality conditions in Corollary 7.7 and Theorem 7 in [We09] say that cl geo (£) = cl r i(£) if and only if 
the mean value set has no non-exposed faces and cl r i(£) = £ if and only if dg is continuous. Returning 
to the classical case of A — C N the equality cl gco (£) = £ holds (see Proposition 8.2 in [We09]) and 
we arrive at the continuity of dg proved by Ay [Ay02]. 



4 ln probability theory, this is often called variation closure and taken in L 1 norm with respect to a dominating 
measure but in finite dimensions the norm does not matter. 
5 A one-dimensional exponential family is called e-geodesic. 










Figure 2: A classical — quantum metamorphosis of 2D mean value sets in the algebra 
Mat(2,C) © C. On the left we have the triangle 5(C 3 ) followed by three shapes with non- 
exposed faces and with a corner of decreasing distinctness. These are followed by three ellipses 
including on the right a projection of the Bloch ball <S(Mat(2,C)). The fifth mean value set 
divides these with a corner from the ellipses. It belongs to the Staffelberg family, which has a 
discontinuous entropy distance. 



An example of an exponential family with non-exposed faces but, contrary to the Staffelberg family, 
with a continuous entropy distance is provided in Section 3.3 with the swallow family. 

Directional derivatives in Section 4 provide a necessary optimality condition. A local maximizer of 
d,£ proves to be completely determined by its projection onto £. This is a quantum analogue of the 
probabilistic case, where a local maximizer is the truncation of its projection. 

1.2 The advantage of mean value sets 

The entropy distance from an exponential family can be a measure of information, see e.g. Ay and 
Knauf [AK06] and references therein — an example is multi-information, which measures stochastical 
dependencies between units of a composite system. We recall how the Pythagorean theorem of relative 
entropy defines a state of minimal relative entropy on an exponential family. This leads us to the mean 
value chart and to mean value sets. 

For N G N we denote by A a *-subalgebra of Mat(iV,C) with identity 1. We write a* for the 
adjoint of a G A and A sa ■= {a G A \ a* — a} for the real vector space of self-adjoint matrices. 
The Hilbert-Schmidt inner product of a, b G -4 S a is (a,b) — tr(ab). A matrix p G A is an orthogonal 
projector if p 2 = p* = p. A matrix a G A is positive, written as a > 0, if a* = a and a if has no 
negative eigenvalue. The state space of A is S(A) :— {p G A sa \ p > 0, tr(p) = 1}, its elements are 
states. 

The space of invertible states is S(A) := {p G S(A) \ p^ 1 exists }. If A ^ {0} then the 
trace-normalized exponential is the real analytic mapping 

exp! : A sa — > S{A), a i — ► ^y . 

This is a diffeomorphism onto S(A) while restricted to traceless matrices. The real analytic inverse 
lno : p H> ln(p) — ltr(ln(/9))/tr(]l) is the canonical chart of S(A). The image of a non-empty affine 
subspace of A sa under expj is an exponential family in A. For an exponential family £ we call ln \e 
the canonical chart of £ and the affine space 9 := liio(£) the canonical parameter space of £. The 
translation vector space V of 9 is the canonical tangent space of £. Sometimes we consider linear 
exponential families with 9 = V to use preparatory work done by Wichmann [Wi63]. 

The relative entropy between states p,a G 5(^4) is S(p,a) := +oo unless a has the larger image 
Im(cr) D Im(p) and then (using the natural logarithm) 

S(p,a) := trp(ln(p)-ln(a)). (3) 

The distance-like properties of S(p.a) > and of S(p,o~) = <=>■ p = a hold, cf. Wehrl [We78]. 

The relative entropy is very well compatible with exponential families. If p, a and r are states in A 
with a and r invertible, and if p — a _L ln(r) — ln(cr), then we have 

S(p,a) + S(a,T) = S(p,r). (4) 



This is the Pythagorean theorem of relative entropy 6 . It is easy to see that the Pythagorean theorem 
(4) holds if <7 and t belong to an exponential family £ in A and if p E S(A) satisfies p — a _L V. 
Therefore we have a projection for states p E £ + V 1 - defined by intersection Trs(p) = (p + V- 1 ) n £ 
understood as the element of a one-element set. This projection satisfies 

inf. S(p,<r) = mm S(p,a) = S(p,TT £ (p)) (5) 

ctE£ ctGc 

and it extends to the cylinder £ + V 1 - of self-adjoint matrices 

n £ : £ + V ± — ► £ , a i — >(a + V J -)r\£. (6) 

The infimum of relative entropy distance (3) between a fixed state and members of the exponential 
family £ is called the entropy distance from £ 

d £ : S(A) — ► R, p >—> inf S(p, a) . (7) 

This is a non-negative and bounded real function on S(A). 

The question arises how large the orthogonal projection of £ to V is. Wichmann has solved 
this problem in the context of maximum entropy ensembles under linear constraints in Theorem II 
of [Wi63]. We cite the relevant part of this theorem starting with linear independent observables 
I, Xi, . . . , X p E A sa - The set of simultaneous mean values consists of the p-tuples of real numbers 
W := {((X 1 ,p),...,(X p ,p)) | pE S(A)}. There is a mapping 



and a second mapping 



u : R p — ► S(A), y .— > ex Pl (ELi^^ 



A^—tW, u^-> ({X u u),...,{X p ,u)), 



such that xou! : M. p — >• W p , j/4 x(u>(y)) is real analytic, xoui is injective and onto the interior of the 
set of simultaneous mean values W. 

We use Wichmanns results in a coordinate free version. They apply to a linear exponential family 
£ with canonical tangent space V = 8 = ln (£ ) as follows. The orthogonal projection ny : A sa — > V 
onto V with respect to the Hilbert-Schmidt inner product defines the mean value set 

mv(V) := ir v (S(A)). (8) 

The mean value chart for £ is the bijection 

n v \ £ : £ — > Int(mv(V)) (9) 

onto the interior of the mean value set (open in the topology of V). The chart change nv\e ° e x Pi \v 
from the canonical chart to the mean value chart is real analytic. 

Moreover, Wichmann proved in Lemma VI, [Wi63], that the mapping a; o w has a positive definite 
Jacobian. This translates to the positive definite Jacobian of ttv\s ° ex Pi \v ar| d then the real analytic 
inverse function theorem proved by Krantz and Parks [KP02] implies that the opposite chart change 
m o If ° (titHs) -1 is rea l analytic. Composition with exp 1 from the left and -Ky from the right yields 
the real analytic projection ng = (ttvIs)^ 1 ° ^v\e+v ± defined in (6). 

2 A classical — quantum metamorphosis 

Non-exposed faces appear only in the projection of a state space. So, in a sense, the convex geometry 
of a mean value set is more complicated than the geometry of a state space. We discuss an example 
where non-exposed faces are typical. This is made precise in the final discussion of the section. The 
example is minimal in two respects: 



6 ln more sophisticated terms it states (see e.g. Petz [Pe08]): if the segment [p, it] and the e-geodesic from 
utOT meet at a orthogonally with respect to the BKM metric, then (4) holds. 



• The algebra A := Mat(2,C) © C is the smallest algebra that allows for a mean value set with 
non-exposed faces. If an algebra has no subalgebra isomorphic to Mat(2,C), then it is abelian 
and has a simplex as a state space — the mean value sets are polytopes without non-exposed 
faces. The algebra Mat(2,C) by itself has the 3D Bloch ball as state space, whose projections 
are a point, a segment or a disk. 

• On the other hand, for the projection shape we consider the minimal dimension two that allows 
for non-exposed faces. 

We denote the identity resp. zero in Mat(2, C) by I2 resp. O2. The Pauli cr-matrices are rri := I „ 

(7 2 :=( n ) and <T3 :=( „ ). We use the vector of Pauli cr-matrices a := (a±, 172,(73) and 

for b = (bi, &2) ^3) € M 3 we write ba = b\&\ + b^cti + &3O3. 

2.1 A cone substitute reduces dimension 

The state space of A :— Mat(2, C) © C of our minimal example is a 4D cone with the Bloch ball as its 
base: 7 

S(A) = conv (S(Mat(2, C)) © 0, 2 © l) . 

In order to sharpen our geometric insight and simplify arguments we want to further reduce the dimen- 
sion. Indeed we can replace it by a 3D cone. The reason is that all 2D linear exponential families are 
contained in a 3D cone, which may, by an automorphism, be assumed to be in the form (14) below. 

The special unitary group SU(2) acts in a double cover SU(2) — > SO(3) on span R {cri, (72, 03} by ro- 
tation (see e.g. Knorrer [Kn06]). The group SO(3) acts transitively on the 2D planes in spanglo^, a 2: cr^i 
0. We choose the reference plane W := span R {<7i, (72} © 0. 

Let us write z := — 5I2 © 1- We denote the space of traceless self-adjoint matrices in A by 
X := span R {(7i,CT2, 173} © + M.z, we denote the Grassmannian manifold of 2D planes in X by G. 
The group SU(2) acts on X with the identity mapping in the second summand, and it acts on the 
Grassmannian G. It is clear that every 2D plane V G G is included in VV_+ M.z for a 2D plane 
W C span R {(7i, CT2, 173} ©0. By the transitive action of SU(2) on these spaces W, every orbit of SU(2) 
on G meets a 2D plane of 

U := W + Rz. (10) 

Therefore, if SO(2) acts on U by rotation of W, we have the orbit isomorphism 

G/SU(2) = { 2D subspaces of U } /SO(2) . (11) 

A complete orbit invariant is the angle between a 2D plane V £ G resp. V C U and the vector z 

<p = <p(V) := 4{V,z). (12) 

Using (11) we confine attention to 2D subspaces V C U and substitute the 4D cone S(A) by a 3D 
cone. We begin with the 2D base disk K := S(A) H (|fl 2 © + W) of diameter \J2 centered about 
|l2 © 0. We use the vector b(a) := (sin(a), cos(a), 0) and describe the base circle of the base disk K 
by the pure states 

p(a) := l(l 2 + b(a)a)®0, «eR. (13) 

We build on K the 3D cone with apex C^ffll^^ffiO + z 

C := conv(K, 2 © l) . (14) 

The cone C is rotationally symmetric about the line / through |l2 © and O2 © 1. Every plane in 
5I2 © + U that contains I meets C in an equilateral triangle, whose center is the tracial state |l. 
The triangle is isometrically isomorphic to the state space of C 3 . 
The cone C is a threefold paradigm of the state space. We have 

7ru(S(A)) = C-\t, (15) 



7 conv denotes the convex hull. 



S(A)n [|1 + 17] = C (16) 

and 

C = ex Pl (C7). (17) 

The proofs of these equations are straight forward. In this section only (15) is important. But since C 
is the closure of an exponential family, we can study in the next section exponential families inside of 
C by taking the intersection (16). 

2.2 Tangent lines to ellipses produce non-exposed faces 

We use two distinct concepts of face to describe the convex geometry of a mean value set. We apply 
these terms to mean value sets on planes of the Grassmannian G discussed in Section 2.1. The second 
and more general idea about a face will be introduced in two (equivalent) forms. 

Let M be a non-empty compact and convex subset of a Euclidean vector space (E, (•, •)). Then for 
non-zero u £ E the supporting hyperplane is defined by 

H(M,u) := {x EE\ (x,u) =m&x(y,u)} . 

In two dimensions dim(E) = 2 this is a line I that meets M such that M \ I is convex. A subset F of 
M is an exposed face of M if F is the intersection of M with a supporting hyperplane 

F(M,u) := Mr\H(M,u). (18) 

F = and F = M are exposed faces by definition. 

An important example is the state space of a *-subalgebra A of Ma,t(N, C). For a self-adjoint 
matrix u £ A sa we denote by /i+(u) the maximal eigenvalue of u and by p+{u) the spectral projector 
of u corresponding to p+(u), which we call the maximal projector of u. 

Lemma 2.1. If u £ „4 sa is a non-zero self-adjoint matrix, then the exposed face F(S(A),u) consists 
of the states p £ S(A) such that (p,u) = p+{u) or, equivalently, Im(p) C Im(p + (u)). 

Proof: This follows from the general situation in a C*-algebra, see e.g. Alfsen and Shultz [AS01], 
and it is easy to verify this directly for A, see e.g. Weis [We09]. □ 

Remark 2.2. The exposed face F(S(A),u) in Lemma 2.1 is the state space of the algebra pAp for 
p := p + {u) discussed below (22). 

Griinbaum [Gr67] calls a subset F of M a poonem, provided there exist sets Fo, . . . ,F n such that 
F = F, F n = M, and Fj,_x is an exposed face of F t for i = 1, . . . ,n. A convex subset F of M is a 
face of AI , if for all x,y £ M and all < A < 1 the inclusion of (1 — X)x + \y £ F implies x, y £ F. 
A face of M , which is not an exposed face of M will be called a non-exposed face of M, see Figure 3. 

Remark 2.3. For a convex set of finite dimension the concept of poonem is equivalent to the concept 
of face. This is easy to prove by induction taking an infimum in the complete lattice of exposed faces, 
see Lemma 5.30 in [We09]. The concept of face is more popular, see e.g. Rockafellar [Ro72], but the 
concept of poonem is more suitable to study entropy distance, see Remark 3.13. 

We return to the Grassmannian G discussed in Section 2.1 and consider a 2D subspace V of the 
space U defined in (10) having an angle of ip = ^.(V, z) with the vector z. By (15) the mean value set 
is 

mv(V) = tt v (S(A)) = tt v (C) 

for the 3D cone C = conv(K, O2 © 1). Here K is a disk with symmetry axis in the direction of z. The 
mean value set is the convex hull of k := ttv(K) and of the projection x := ttv(02 © 1) of the apex of 
C, 

mv(V) — conv(k,x) . (19) 
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Figure 3: A non-exposed face. The drawing shows a repeated inclusion of exposed faces. The 
segment is an exposed face of the convex surface and the point is an exposed face of the 
segment. But the point is a non-exposed face of the convex surface. 



Lemma 2.4. Let V £ G be a 2D plane. Then x £ k if and only if | < ip < | . If ip = 0, then the 
mean value set mv(V) is a triangle. IfO < ip < ^, then k is an ellipse and the tangent points of the 
two tangent lines from x to k are non-exposed faces ofmv(V). If | < <p < §, then mv(V) = k is an 
ellipse. 

Proof: By the orbit isomorphism (11) we assume V C U and we use (19). Instead of the passive 
view of a moving plane, we imagine a rotating cone C against the horizontal plane V. The symmetry 
axis I of C through |l 2 © and 2 © 1 follows the direction of z. We fix a vertical plane through I. 
This meets C in an equilateral triangle A. The side b of A, which is not incident with the apex O2 © 1 
of C runs in the base disk K of C and b is therefore pivotal to decide about x £ k. This happens if 
and only if b covers the other two sides of A, so x £ k if and only if ?£(V,b) < f ■ As I and b are 
perpendicular, we have x £ k <=> ^ < ip < | . 

At ip = the symmetry axis I of the base disk K of C is parallel to V, so k — ttv(K) is a segment 
and the mean value set mv(V) — conv(k,x) is a triangle. Otherwise, if ip > 0, then k is an ellipse. 
The case of ^ < ip < § with x £ k reduces to mv(V) — k. In the case < ip < f with x <=£ k there 
are two tangent lines from x to the ellipse k with tangent points ti,t2,£ k. A supporting hyperplane 
that meets t. b must include the tangent line segment [a;, U], so {t{\ is not an exposed face of mv(V) 
for i = 1,2. On the other hand ti is an endpoint of the exposed face [x,ti] and hence is a poonem 
(or a face likewise) of mv(V) for i = 1,2. We conclude that {£1} and {t 2 } are non-exposed faces of 
mv(V). D 

In Figure 2 mean value sets mv(V) are drawn at the angles tp £ {^ | k = 0, . . . ,6}. The third 
drawing at </2 = -| is also used in Figure 3 to demonstrate non-exposed faces. 

The results of this section show that non-exposed faces of a mean value set are typical in the 
following sense. A continuous curve 7 : [0, 1] — > G induces a curve of mean value sets A i-> mv(7(A)). 
By Lemma 2.4 a mean value set without non-exposed faces must be a triangle or an ellipse. If 7 
connects the classical mean value set of a triangle to an ellipse, then we have 2^(7(0), z) = and 
2^(7(1), z) £ [^, |-]. Since the angle ip is continuous on G, the curve 7 must cross the range of angles 
(0, |) with mean value sets having non-exposed faces. This range corresponds to an open subset of 
the Grassmannian G. 

3 Closures of exponential families 

In the sequel of this article we assume that A is a *-subalgebra of Mat (AT, C) and that £ is an exponential 
family in A with canonical parameter space 9 and canonical tangent space V. The previous section 
has revealed for two-dimensional mean value sets in the algebra Mat(2,C) ©C that an angle of (p = ? 
divides mean value sets with non-exposed faces from others without non-exposed faces. We show that 
the Staffelberg family at <p = f has a discontinuous entropy distance. We compute closures of the 
swallow family and of the Staffelberg family based on a few general results. 

3.1 General results 

In this section we prove a set-theoretic upper bound for the norm closure £ of the exponential family 
£. We describe the geodesic closure of £ as a union of exponential families in compressed algebras. 



We compute the entropy distance from e-geodesics and their limit points. 

First we focus on the norm closure. There is an upper bound on the supplementary set £\£. This 
depends only on the canonical tangent space. 

Lemma 3.1. Let £ be a linear exponential family in A and assume (pi) c £ is a sequence of states with 
norm limit p = lim^oo pi £ S(A). Either p £ £ or there exists a non-zero vector v in the canonical 
tangent space V of £ , such that p belongs to the exposed face F(S(A),v) of the state space. 

Proof: If the sequence 9i := lno(pj) of canonical parameter values has a bounded subsequence, 
then this subsequence has an accumulation point 9 £ V = mo(£) and then by continuity of exp x 
we have p = exp 1 (0). Otherwise lirn^co ||#;|| = oo and then there exists a converging subsequence 
(w.l.o.g. we keep {9{) and (pi)), such that v := lim^oo J^w £ V. An argument from Lemma VII in 
[Wi63], which can be proved by standard perturbation theory (see Corollary 6.26 in [We09]), is 

(p,v) = lim (pi,v) = p+(v). 

Now Lemma 2.1 proves that p £ F(S(A),v). □ 

An e-geodesic is a one-dimensional exponential family. The free energy, defined for a £ A sa by 
F(a) := ln(tr(e a )) is a useful function to discuss the structure of limit points of e-geodesics included 
in an exponential family. 

Lemma 3.2. Suppose 9, u £ _4 sa and p := p+(u) is the maximal projector of u. We have 

pe p6p 

lim exp, (9 + X u) = — -. 5— (20) 

A^oo ilv ; ti(peP 8 P) K ' 

and 

lim (F(6 + \u) -Xfx + (u)) = In (ti{pe pep )) . (21) 

A— >oo 

Proof: If u has maximal eigenvalue p+(u) = then limA^ooe e+A " = pe p6p . This equation is 
proved by standard perturbation theory. By the invariance for a £ R of exp 1 (0 + al) = exp 1 (0), one 
obtains (20). The limit liniA->oo e 0+x u = pe p0p implies (21) through the equivariance for a e t of 

the free energy F(9 + at) = F(9) + a. □ 

The only extension of the exponential family £, that we can compute in general in this article, is 
the geodesic closure 

cl g co(£) — £ Li { limit points of e-geodesics in £ } . 

We show that cl gco (£) is a union of exponential families. If p G A is a non-zero orthogonal projector, 
then the compressed algebra by p is 

pAp := {pap I a £ A} . (22) 

The algebra pAp is a C*-subalgebra of A with identity p. The Hilbert space representation induced 
from A makes the embedding pAp —> A trace-preserving. We denote functions on pAp that depend 
on the representation by a superscript p, e.g. \n p (p) = 0, while ln(p) is not defined. For a £ pAp we 
notice exp p (a) — pexp(a), exp p (a) — tT pe ea ) and F p (a) = lntr(pe a ). The projection 

c p : A — ► pAp, a 1 — >pap-p t ^^ 
is important to define the exponential family in pAp 

£ p := {exp p oc p (9) \ £ 9} . 
This has canonical parameter space c p (&) and canonical tangent space c p (V). 
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Proposition 3.3. Suppose £ is an exponential family in A. The geodesic closure of £ is 

clgeo(£) = U p £ P , 

where the disjoint union extends over the maximal projectors p = p+(v) of all vectors v £ V. 

Proof: This follows from (20) in Lemma 3.2. □ 

The rl-closure of the exponential family £ was computed by Weis [We09]. In this article we repeat 
this in an elementary way for the Staffelberg family and the swallow family. The rl-closure of £ is 
defined by 

cln(f) - { P eS(A)\d £ (p) = 0}. 

We relate the geodesic closure to the rl-closure using asymptotics of the entropy distance de along 
e-geodesics. By Dyson's expansion we can arrive at the derivative of the exponential, see e.g. Bhatia 
[Bh97]. For a, b £ A this is 

D| Q exp(6) = £e a+xb \ x = = /„* e^be^-v^dy . 

Then we get for the free energy F the derivative 

B\ a F(b) = (&,ex Pl (a)>. (23) 

For p, a £ S(A) we denote the relative entropy S(p,<r) defined in (3) by S p (cr). In the following proof 
we control the limit of S p along an e-geodesic. This is remarkable, since on the subset of S(A), where 
the relative entropy is finite, it is not continuous but only jointly lower semi-continuous (see [We78]). 
This happens already for the algebra C 2 of a bit. 

Lemma 3.4. Suppose 0, it £ A sa and u is not proportional to the multiplicative identity 1 in A. If the 
state p belongs to the exposed face F(S(A),u) of the state space, then S p (exp 1 (9 + An)) is strictly 
monotone decreasing with A £ R and 

inf S p (cxp 1 (9 + Ait)) = S p ( lim exp x (0 + A it)) . 

A£S£ A— >oo 

Proof: By definition (18) of an exposed face we have for r £ S(A) the inequality (u, r — p) < 
because p belongs to F(S(A),u). An invertible state r has the image Im(r) = Im(]l) (/_ Im(p + (it)) 
and hence r $• F(S(A),u) by Lemma 2.1. This implies the strict inequality (u,T — p) < 0, which 
holds for the invertible states r = cxp 1 (8 + Ait) with AeM. Using (23) we have for all A e K 

^S p o exp 1 (9 + Xu) = (u,exp 1 (9 + \u) — p) < 0. 

We conclude that S p o cxp 1 (9 + \u) is strictly monotone decreasing in A. 

We prove the infimum-limit equality. Let us consider the e-geodesic g : A H> cxp 1 (6' + Ait). The 
limit for A — > oo of g is calculated in (20), we set p := p+{u) for the maximal projector of it and 

cr := limA^oo g(X) = expf(p0p). 

The states p and a belong to the compressed algebra pAp defined in (22) and a is invertible in pAp. 
Then 

-S(p,a)-S(p) = ti-{p\n p oexp p 1 ( P 8p)) = tr(p0) - FP(p9p) 
= lim A _>oo [tr(p9) + Xp+(u) - F(9 + Am)] 
= liniA^oo [tr(/j(0 + Au)) - F(9 + Xu)] 
= lirriA->.oo tr(/o lnoexp 1 (0 + Am)) = limx^cc [~S{p, g{\)) ~ S(p)} . 

We have used (21) in the third step. The result is limA->oo S p og(X) — S p (<r). Since S p og is monotone 
decreasing in A we have inf AeR S p o g(X) = S p (a). D 

We can relate the geodesic closure to the entropy distance and to the norm topology. 



Proposition 3.5. Suppose £ is an exponential family in A with canonical tangent space V. lfv£V 
is non-zero and p belongs to the exposed face F(S(A),v), then ds(p) = d £P+ ( V )(p). In particular we 
have d £ (p) = inf CTec i gso ( £ ) S(p,a) and cl gco (£) C cl,i(£) C £. 

Proof: We prove the first statement, let p :=p + (v). If p+(v) = 11, then there is nothing to prove. 
Otherwise v is not a multiple of 11 and then we have by Lemma 3.4 and Lemma 3.2 

d £ (p) = miS(p,a) = inf inf S{p,exp l {6 + Xv)) 

cr£c 6£Q AGlR 

= MS(p, lim ex Pl (0 + Av)) = inf S(p,expP o c p (6>)) = d £P (p). 

0€© A— >oo #GO 

Now the second statement and the inclusion cl goo (£) C cl r i(£) follow from Proposition 3.3. 

The Pinsker-Csiszar inequality, proved e.g. in the book by Petz [Pe08], says that \\p— <j||f < \S(p, a) 
holds for all states p, a £ S(A) with the trace norm indicated by || ■ ||i. This implies the inclusion of 

ci rT (£) c£. a 



3.2 The Staffelberg family 

The Staffelberg family £ corresponds to an angle of ip = -| in (12), the special point in the metamor- 
phosis of Figure 2. We calculate closures of £ and we prove cl geo (£ ) = cl r i(£) C £. This implies that 
the entropy distance from £ has a discontinuity. 

Definition 3.6. The Staffelberg 9 family is the linear exponential family in A := Mat(2,C)©C defined 
by 

£ := expj (span R {ui © 0, cr 2 ® 1}) • 

The canonical tangent space V = 6 of £ is spanned by v\ := cr\ © and v-i := u% © 1 — |l- The 
vector z = — ifl 2 © 1 is perpendicular to v\, so 

tp = K(V,z) = K(v2,z) — arccos(i) = |. 

The Staffelberg family £ is included in the 3D cone C introduced in (14). We use the notation 
B := {p(a) | a £ (0, 2tt)} for the punctured base circle of C with p(0) = h(^2 + 02) © missing. In 
Corollary 3.9 we prove that d £ is discontinuous at p(0) because B C c\ r \(£) while ^(^(O)) = ln(2). 

The symmetry axis / of C goes through the tracial state |l and the apexf^ffil, where the generating 
lines of C meet with I under an angle of |. The generating line [p(0),02 © 1] is perpendicular to V. 
We denote by c := |(p(0) +O2 © 1) the midpoint. The basis vectors of V connect special points in C, 

v x = CTi © = p(f ) - p(§7r) and u 2 = cr 2 © 1 - |l = | (c - p(tt)) . 

The algebra generated by ct 2 © 1 is isomorphic to C 2 and it has the segment [p(ir), c] as its state space. 
The e-geodesic {exp 1 (A(cr 2 ©l)) | A £ M} is included in £ and it covers the invertible states in [p(ir),c\. 
The Staffelberg family is depicted in Figure 4. 

Remark 3.7. Let us discuss a polar parametrization of the Staffelberg family £. The zero trace of V\ 
and V2 above is not important. We use 

u(o) ■— sin(a)cr 1 ©0 + cos(a)(cr 2 © 1), ael. (24) 

We need for non-zero tet 3 and \b\ := ^Jb\ + b\ + b\ the formula in Mat(2,C) 

exp(6a)= coehfl&pa + sinhfl&l)^ a. 

Writing b(a) = (sin(a),cos(a), 0) we parametrize £ for a £ R and t > by 

a(a,t) ■- ex Pl (u(a)t) = } \(1 2 coshft) + b(a)a sinh(t)) © e cos(a)t l . (25) 

T(a, t) L J 



8 That exponential family has the form of the Staffelberg mountain, in the natural preserve of Veldensteiner 
Forst. 
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p(0) 



P(0) 



Figure 4: An e-geodesic model of the Staffelberg family £. Left: The elliptic mean value set 
below indicates the alignment of the canonical tangent space V. The cone C containing £ has 
apex O2 © 1 and the generating line [/?(()), O2 © 1] (black, thick) is perpendicular to V. The 
open segment (black, dashed) from the midpoint c of this generating line through |i to p(ir) 
belongs to £. The large equilateral triangle is a section through the symmetry axis of C, it is 
isometrically isomorphic to the state space of C 3 . Right: The geodesic closure cl geo (£) covers 
the punctured base circle of C (large circle) with p(0) (small circle) missing. The geodesic 
closure also includes c (thick). The segment [p(0),c] (dashed) belongs to the norm closure £. 



(a(a,t),v 3 ) = 



-(- sinh(i) cos(a) - cosh(i) + e tcos ^) 



(26) 



Here T(a,t) := 2cosh(i) + e cos ( Q )' js for normalization. The vectors v\ and v 2 are completed by 
v 3 := -p(0) © 1 to an orthogonal basis of U = V + Rz. We obtain for all a S M and i > 

1 
T{a,t)' 
We discuss closures of the Staffelberg family and its entropy distance. 

Theorem 3.8. The Staffelberg family £ has geodesic closure and rl-closure equal to cl g0 o(£) = 
cl r i(£) = £UBU{c}, the norm closure is £ = £l)Bl)[p(Q),c]. The entropy distance of p e [p(0),0 2 ffil] 
from £ is dg{p) — S(p,c). 

Proof: We calculate the geodesic closure c\ goo (£ ) using Proposition 3.3 as a union of exponential 
families £ q = {exp'oc 9 (6') | 9 <E V} for maximal projectors q. In place of the maximal projectors of 
v ^ in V we consider the maximal projectors of the vectors {u(a) \ a £ R} in (24). We distinguish 
two cases depending on the spectral projectors in the orthogonal sum 

u(a) — p(a) — p(a + it) + O2 © cos(a) . 

The maximal eigenvalue of u(a) is constant one. If a 7^ mod 2ir, then the maximal projector of u(a) is 
p(a). This has rank one and £PW — {p(a)}. We have proved B c cl gco (£). If a = mod 2tt, then the 
maximal projector of u(0) \sp := p(0) + 02©l = 2c. The projections p(eri©0)p = and p(o"2©l)p = P 
show that the canonical parameter space of the exponential family £ p is c p (8) = c p (V) — {0}. We 
get £ p — {c} and we conclude cl goo (£) = £ U B U {c}. 

For a discussion of the norm closure we consider exposed faces as proposed in Lemma 3.1. This 
provides an upper bound on £ in terms of exposed faces by vectors v E V. These faces depend only 
on the maximal projector of v by Lemma 2.1, so we have 

£ C £ U\J aeR F(S(A),u(a)). 

We start with a one-dimensional face. The segment [p(0),0 2 © 1] is the exposed face F(S(A),u(0)) 
and we have p + (u(0)) = p. The fact £ p — {c} and Proposition 3.5 show for p £ [p(0), O2 © 1] 



de(p) = d £P (p) = S(p,c). 



(27) 
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For a 7^ mod 2-zr the maximal projector p(a) of u(a) has rank one and the exposed face is 
F(S(A),u(a)) = {p(a}}. We obtain 

£ c £ U B U [p(0),0 2 ©l]. 

Since 5 C cl gco (£ ) Proposition 3.5 shows B C £. We finish the calculation of £ by showing that 
exactly the half [p(0), c] of the segment [p(0), O2 1] belongs to £ . 

We prove that at most the half segment [p(0),c] belongs to £ by showing that £ lies completely 
in one of the closed half spaces of |]1 + U divided by the hyperplane c + V. This works because the 
vector W3 defined in the paragraph after (25) is orthogonal to V and 

(p(0),v 3 ) = -1, (c,v 3 ) =0, and (O 2 0l,u 3 ) = 1. 

It remains to prove (cr, V3) < for all states a in £. In the polar parametrization (25) of £, R x M^ — > £, 
(a,i) h-> cr(a,t) the denominator T(a,t) of (tr(a,i),W3) is strictly positive (26). So we can prove for 

a e R, t > and 

z(a,t) := T(a,t){a(a,t),v 3 ) = -cos(a) sinh(i) - cosh(i) + e cos(a) * , 

that z(a, t) < holds. For all a E M we have z(a, 0) = and for all a e K and i > 

J^z(a,t)±z(a,*) = (cos(a) ± 1) [e cos ( Q )* - e ±4 ] < 

implies -gjz(a,t) < and by integration from to i we get z(a, i) < 0. 

We show [p(0),c] C £. The state p(0) lies in the closure of B and c is the limit of an e-geodesic 
in £. We still have to approximate for A e (0, 1) the state r(A) := (1 — f)p(0) | from within £. For 

t>0we choose a(i) := J\ ln(^) with lim^oo a(t) = and lim^oo e ( cos ( Q (*))-i)* = _A_. jh 



en 
we expand numerator and denominator of a(a(t),t) defined in (25) by the factor e~ l and we get 

lim^aHt),;) = ^g±ggjEx = r(A) . 

Finally we calculate the rl-closure. This is sandwiched by Proposition 3.5 between geodesic and 
norm closures 

£UBU{c} c cl rI (£) C £UBU[p(0),c\. 

It remains to discuss states p E [p(0),02®l]. We use de(p) = S(p,c) proved in (27), whence ds{p) = 
implies p = c. □ 

Corollary 3.9. The entropy distance de from the Staffelberg family is discontinuous at p(0). 

Proof: By the previous theorem we have de(p(0)) = S(p(0),c) = ln(2) while dg = on the 
punctured base circle B of the cone C. This includes p(0) in its norm closure. □ 



3.3 The swallow family 

We now consider 2D families £ = exp 1 (V) that have non-exposed faces in the mean value set. By 
Lemma 2.4 this corresponds to an angle <p(V) E (0, 7r/3), see (12). We calculate closures of £ and we 
prove cl geo (£) Q cl r i(£) = £. 

Calculations become easy for <p = arccos(-i/|) « 0.28-7T and we then call £ the swallow family: 



Definition 3.10. The swallow family is the linear exponential family in A := Mat(2,C) C defined 
by 

£ := exp-L (span K {cri 1, 02 1}) . 
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Figure 5: An e-geodesic model of the swallow family £. The mean value set below £ indicates 
the alignment of the canonical tangent space V. The cone C containing £ has apex O2 © 1 
and the generating lines [p(0), O2 © 1] and [p(f ), O2 © 1] (black) belong to the rl-closure of £ . 
Their lower corners p(0) and p(f ) do not belong to the geodesic closure of £ , they project to 
the non-exposed faces of the mean value set. 



The canonical tangent space V = of £ is spanned by v\ := u\ © 1 — gfl and t>2 := 02 © 1 — |l. 
The vector z = — i]l 2 © 1 is perpendicular to v 2 — V\, so indeed 

y = -4(V, z) = 2<(«i + v 2 ,z) = arccos(w §) . 

In Lemma 2.4 we have seen that this angle produces non-exposed faces at the mean value set mv(V). 
The swallow family is included in the 3D cone C defined in (14). We have mv(V) = 7iv(C) and the 
points p(0) = J(l2 + <r 2 ) © and p(f ) = §(H.2 + ci) © on the base circle of C project to the 
non-exposed faces Try(p(0)) and 7ry(p(|)) of the mean value set, see Proposition 3.11. They are also 
significant because we prove in Theorem 3.12 that they are missing in the geodesic closure cl geo (£) 
while they belong to rl-closure cl r i(£). The swallow family is depicted in Figure 5. 

Proposition 3.11. IfV denotes the canonical tangent space of the swallow family, then {7ry(p(0))} 
and {7Ty(/o(f ))} are non-exposed faces of the mean value set mv(V). 

Proof: The mean value set is the projection of the cone C to V. It is the convex hull of the ellipse 
k, which is the projected base disk of C, and of the point x = ttv(02 © 1). which is the projected 
apex, see (19). Let us denote the boundary of k by e. Since ip rj 0.287T < |, we have x $. k and the 
tangent points t\ and t 2 from itoe are non-exposed faces of mv(V^) by Lemma 2.4. We calculate t\ 
and £2 using the polar of x with respect to the conic e (in the sense of projective geometry): we have 
to find on the real space V a symmetric bilinear form j3 including linear and constant terms, such that 
e = {v £ V I f3(v,v) — 0}. Then the polar {v E V | /3(v,x) = 0} of x intersects e in t\ and t 2 , see 
e.g. Section 3.5.11 in Fischer [Fi85]. 

The ellipse e consists of the projected base circle points {7ry(p(a)) | a € M.} of the cone C. We 
can neglect the projection to V by using the linear forms r\ : A sa — >• K., a n- (a, v\) and £ : A sa —t K, 
a h^ (a, 1*2)1 which are invariant under iry- These evaluate for a € R as 

r] (p(o)) = sin(a) — I and £(/°( a )) — cos(a) — | . 
The bilinear form 

: Aa x Aa — > M, (a, 6) ^™> »?(a)r?(6) + C(o)$(6) + |fa + 0(« + 6) - 5 
satisfies /3 (p(a), p(a)) = for all a e M and it satisfies /3(p(0), 2 © 1) = /3(p(f ), 2 © 1) = 0. □ 

We denote the open segment between a,b E A sa by ]o, b[ := {(1 — X)a + Xb \ A E (0, 1)}. 
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Theorem 3.12. The geodesic closure of the swallow family £ is the union of £ with the rank-two states 
in the segments]p(0),0 2 ®l[and}p(^),0 2 ®l[ancl with the pure states 2 ffil and {p(a) | f <a< 2n}. 
The rl- and norm closures are cl r i(£) = £ = cl geo (£ ) U {p(0), p(f )}■ 

Proof: We repeat some arguments from Theorem 3.8. First we calculate the geodesic closure 
cl geo (£) using Proposition 3.3. For a £ R we have the orthogonal sum 

u(a) := sin(a)Oi© 1) + cos(a)(cr 2 © 1) = p{a) - p(a + n) + 2 © \/2cos(a - f ) . 

The maximal projectors are 

p := p+(u(0)) = p(0) + 2 © 1 , q := p+(u(|)) = p(|) + 2 © 1 , 

p+(u(a)) = 2 © 1 for < a < § and p + (u(a)) = p(a) for § < a < 27r. We notice that 
p(ci © l)p = 2 © 1 and this implies that the projection c p : A — >• p*4p = C 2 restricted to V has the 
image M(0 2 © 1 — p(0)) = R(l, — 1) C C 2 . The analogue arguments apply to q, so the exponential 
family 

£ p = ]p(0),0 2 ©l[ resp. f« = ]p(f ),0 2 © 1[ 

in p„4p resp. <7„4.q consists of the invertible states in the compressed algebra pAp resp. qAq. All other 
maximal projectors r of elements of v ^ of V have rank one and produce the exponential family 
£ r = {exp^ o c r (8) | 9 £ V} = {r}. This completes the calculation of the geodesic closure. 

In the second step we prove that the points p(0) and p(f ) missing in the geodesies closure belong 
to the rl-closure of the swallow family. By Lemma 2.1 we have the exposed face F(S(A),u(0)) = 
[p(0),0 2 © 1], which contains the state p(0). Then we obtain from Proposition 3.5 

d £ (p(Q)) = d £P (p(0)) . 

We may use the e-geodesic g : A H> exp^(Ap(0)) to prove that the state p(0) lies in cl gco (£ p ). So 
d£p(p(Q)) = holds and this implies ds(p(0)) = and p(0) € cl r i(£). The analogue arguments show 

p(f)ecl rI (£). 

In the last step we use Lemma 3.1 which says that the norm closure £ is included in the union of £ 
and the exposed faces of the state space S(A) by vectors v in V. These face are shown in Lemma 2.1 to 
depend only on the maximal projector of v, so we use u(a) instead. The exposed faces F(S(A), u (a)) 
are discussed above and we have £ = cl r i(£). □ 

Remark 3.13. • One can establish cl r i(£) = £ for any 2D exponential family £ in Mat(2, C) © C 

with canonical tangent space V and ip = t£(V, z) ^ ?. This is proved by arguments analogous 
to Theorem 3.12 if <p < ^ and becomes even simpler for ip > | where £ \ £ is the base 
circle {p(a) \ a £ [0, 2ir)} of the cone C. So the discussion following (2) shows that the 
Staffelberg family is indeed the unique exponential family in the metamorphosis of Figure 2 with 
a discontinuous entropy distance. 

• There is no e-geodesic in the swallow family £ that meets p(0) asymptotically. Calculation of 
cl r i(£ ) in Theorem 3.12 is done by two limits of e-geodesics. One is implicit in the equation 
df(p(0)) = d£ P (p(0)) where e-geodesics in £ with limit on £ p —}p(0), 2 ffil[ were used. Only the 
second geodesic g in £ p meets p(0) asymptotically. In the projection to V this two-stage analysis 
corresponds to a description of the poonem 7Ty(/?(0)) of the mean value set mv(V) — ttv(C) by 
the exposed faces n v (C) D ir v ({p(0), 2 © 1]) D n v (p(0)), see Figure 3. 

4 Maximizers of the entropy distance 

In applications to neurosciences and statistical physics it is desirable to know the local maximizers of 
entropy distance ds from the exponential family £, see e.g. Ay and Knauf [AK06]. We can show that 
a local maximizer p of ds carries a clear imprint from its projection tts{p) t0 £■ This generalizes the 
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classical case, where p is the conditional probability distribution of tt £ (p) conditioned on its own support 
supp(p) 

p = tt £ (p)( ■ \supp(p)) . (28) 

We use differential calculus in a matrix algebra (see e.g. Bhatia [Bh97] for introductory literature) and 
compute directional derivatives of d £ at a state along all the directions of its face. We obtain a quantum 
analogue of (28). 

Remark 4.1. In the probabilistic case the assertion (28) was proved for a local maximizer p E £ + V 1 - 
of d £ by Ay [Ay02]. All (one-sided) directional derivatives of d £ were computed by Matus [Ma07] in 
the probabilistic case. Beyond this more difficult question, Section 5 in the latter reference contains 
further characterizations of a local maximizer that wait to be examined in the quantum case. 

The support projector of a state p E S(A) is the orthogonal projector p E A with the same image 
Im(p) = Im(p). The von Neumann entropy of p is S(p) := — tr(pln(p)). We denote functions in pAp 
by a superscript, see the paragraph following (22). 

Theorem 4.2. Suppose A is a *-subalgebra of Mat(N,C) and £ is a linear exponential family in A. 
Let p E £ + V 1 - be a state in A, let p denote the support projector of p and put 9 := lnrj OTre(p) G V. 
If u is a traceless self-adjoint matrix in pAp, then ~D\ p de(u) = (u, ln p (p) — 6). If p is a local maximizer 
ofd £ , then p = cxj> p { P 9p) and d £ {p) = F(6) - FP{p9p). 

Proof: We have discussed in the paragraph following (9) that the projection ir £ defined for a E 
£ + V 1 - by intersection a H> (a + V- 1 ) n £ is real analytic. We obtain the real analytic mapping 

L : £ + V 1 - — > V, a i — ► ln o n £ (a) . 

The projection of a is written ii£(a) = expj oL(a). It can be used to describe the entropy distance (5) 
of a state p E £ + V 1 - from £ by 

d e {p) = S( Pl ir £ (p)) = 5(p,cx Pl oL(p)) (29) 

= -S(p) -tT(phxo&q? 1 oL(p)) = -S{p)-ti(pL(p)) + FoL{p) 

with the free energy F. As the state p is positive and invertible in the algebra pAp, we can differentiate 
at p the logarithm ln p in pAp in the direction of any self-adjoint matrix u E pAp. A formula in [Li73] 
reads 

D| p ln» = /"(p + spJ-Vp + ap)" 1 ^ 

and we get by reordering under the trace 

B\ p S(u) - -<u,ln*(p))-tr(u). 

Using the derivative of the free energy (23), which is for a,b E A given by D\ a F(b) — (b, exp 1 (o)), 
the chain rule leads to 

D\ p (FoL)(u) = D\ L{p) FoD\ p L(u) 

= <D| p L( U ),ex Pl oL(p)) = (D\ p L(u),ir £ (p)) . 

Since the image of L is V we have DLi(u) E V and thus by definition of the projection ir £ follows 
(D\ p L(u),ire{p) — p) =0. Differentiation of (29) in the direction of a traceless self-adjoint matrix 
u E pAp gives 

B\ p d £ (u) = (u,ln p (p)) +tt(u) - (u,L(p)) - (p,D\ p L(u)) 
+(T>\ p L(u),7t e (p)) = {uM p { P )~L{p)). 

This completes the asserted directional derivative. 

If p is a local maximizer of d £ , then ln p (p) = pL(p)p + Xp for some real A because p spans the 
orthogonal complement of the space of traceless self-adjoint matrices in pAp. If follows that p must 
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be proportional to pexp(pL(p)p) as claimed. If we write 9 := L(p) = \n oTr £ (p), then we have 
p = cxp\(p9p) and ir £ (p) — cxp 1 (9). We get 

d £ (p) = S(p,TT £ (p)) = tr{p(\n p (p)-lnon £ (p))} 

= tr [p{p9p — plnotr o exp p (p Op) — 9 + Mnotr o cxp(#))] 

= ln(tr(e 9 ))-ln(tr(pe p9p )). D 

Acknowledgment: SW thanks the organizers of the DFG research group "Geometry and Complexity 
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